A Novel Approach to Helicobacter pylori Pan-Genome Analysis for Identification of Genomic Islands
نویسندگان
چکیده
Genomes of a given bacterial species can show great variation in gene content and thus systematic analysis of the entire gene repertoire, termed the pan-genome, is important for understanding bacterial intra-species diversity, population genetics, and evolution. Here, we analyzed the pan-genome from 30 completely sequenced strains of the human gastric pathogen Helicobacter pylori belonging to various phylogeographic groups, focusing on 991 accessory (not fully conserved) orthologous groups (OGs). We developed a method to evaluate the mobility of genes within a genome, using the gene order in the syntenically conserved regions as a reference, and classified the 991 accessory OGs into five classes: Core, Stable, Intermediate, Mobile, and Unique. Phylogenetic networks based on the gene content of Core and Stable classes are highly congruent with that created from the concatenated alignment of fully conserved core genes, in contrast to those of Intermediate and Mobile classes, which show quite different topologies. By clustering the accessory OGs on the basis of phylogenetic pattern similarity and chromosomal proximity, we identified 60 co-occurring gene clusters (CGCs). In addition to known genomic islands, including cag pathogenicity island, bacteriophages, and integrating conjugative elements, we identified some novel ones. One island encodes TerY-phosphorylation triad, which includes the eukaryote-type protein kinase/phosphatase gene pair, and components of type VII secretion system. Another one contains a reverse-transcriptase homolog, which may be involved in the defense against phage infection through altruistic suicide. Many of the CGCs contained restriction-modification (RM) genes. Different RM systems sometimes occupied the same (orthologous) locus in the strains. We anticipate that our method will facilitate pan-genome studies in general and help identify novel genomic islands in various bacterial species.
منابع مشابه
Strain-specific genes of Helicobacter pylori: genome evolution driven by a novel type IV secretion system and genomic island transfer
The availability of multiple bacterial genome sequences has revealed a surprising extent of variability among strains of the same species. The human gastric pathogen Helicobacter pylori is known as one of the most genetically diverse species. We have compared the genome sequence of the duodenal ulcer strain P12 and six other H. pylori genomes to elucidate the genetic repertoire and genome evolu...
متن کاملPan-Genome Analysis of Human Gastric Pathogen H. pylori: Comparative Genomics and Pathogenomics Approaches to Identify Regions Associated with Pathogenicity and Prediction of Potential Core Therapeutic Targets
Helicobacter pylori is a human gastric pathogen implicated as the major cause of peptic ulcer and second leading cause of gastric cancer (~70%) around the world. Conversely, an increased resistance to antibiotics and hindrances in the development of vaccines against H. pylori are observed. Pan-genome analyses of the global representative H. pylori isolates consisting of 39 complete genomes are ...
متن کاملDraft Genome Sequences of Helicobacter pylori Strains HPARG63 and HPARG8G, Cultured from Patients with Chronic Gastritis and Gastric Ulcer Disease
Helicobacter pylori colonizes the human gastric mucosa, leading to a spectrum of gastric diseases in susceptible populations. Here we announce the draft genome sequences of strains HPARG8G and HPARG63. The data for both genome sequences provide insights regarding the diversity in gene content and rearrangement of the genomic islands commonly harbored by H. pylori.
متن کاملA Systematic In-silico Analysis of Helicobacter pylori Pathogenic Islands for Identification of Novel Drug Target Candidates
BACKGROUND Helicobacter pylori is associated with inflammation of different areas, such as the duodenum and stomach, causing gastritis and gastric ulcers leading to lymphoma and cancer. Pathogenic islands are a type of clustered mobile elements ranging from 10-200 Kb contributing to the virulence of the respective pathogen coding for one or more virulence factors. Virulence factors are molecule...
متن کاملGenomic Comparison of cag pathogenicity island (PAI)-positive and -negative Helicobacter pylori strains: identification of novel markers for cag PAI-positive strains.
In an analysis of Helicobacter pylori genomic DNA by macroarray methodology, genomic DNA from a panel of cag pathogenicity island (PAI)-negative H. pylori clinical isolates failed to hybridize with 27 genes located outside the cag PAI in a cag PAI-positive reference strain. PCR analyses confirmed that HP0217 (encoding a lipopolysaccharide biosynthetic protein) and HP1079 (encoding a protein of ...
متن کامل